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Introduction 


During the first outbreak of severe acute respiratory syndrome (SARS) in 
early 2003, almost 8500 people worldwide were diagnosed with the disease, 
and approximately 800 died of it. The epidemic was contained by July, 2003, 
but since then, the disease has re-emerged in the Guangdong province in 
Southern China around the turn of the year 2003, although the number of 
cases was very limited this time. In addition, there were two cases of SARS 
caused by improper sample handling in research laboratories in Singapore 
and Taiwan. A “mini-outbreak” originated from several independent labo- 
ratory infections at a research laboratory in Beijing in March and April, 
2004, and led to infection chains encompassing three generations. This time, 
China escaped another epidemic since fortunately, the number of diseased 
people was limited to nine, although there was one fatality. In any case, 
these isolated mini-outbreaks remind us of the global threat that SARS still 
represents. 

In late March, 2003, a new coronavirus, SARS-CoV, was isolated from 
affected tissue of SARS patients and identified as the cause of the disease 
[1,2]. Coronaviridae are RNA viruses that have been united with the fam- 
ilies Arteriviridae and Roniviridae in the order Nidovirales. The phyloge- 
netic relationship of these morphologically unrelated virus families is based 
on their similar polycistronic genome organization, common transcription- 
al and post-translational strategies, and a conserved array of homologous 
domains in the viral polyproteins [3-5]. Coronaviruses are responsible for a 
number of diseases of man and animals, some of which have major eco- 
nomical impacts, although the human coronaviruses have clearly been neg- 
lected before SARS emerged. Transmissible gastroenteritis virus (TGEV) 
infection is associated with severe and often fatal diarrhoea in young pigs 
[6, 7]. The human coronaviruses (HCoV) 229E and OC43 are major causes 
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of upper respiratory tract illness in humans, in particular, common cold-like 
conditions [8]. To date, only the 229E strain of HCoV has been character- 
ized in detail because until the discovery of SARS-CoV, it used to be the 
only isolate to grow efficiently in cell culture. In contrast to the relatively 
harmless disease caused by these human coronaviruses, SARS is character- 
ized by high fever, malaise, rigor, headache, non-productive cough, or dysp- 
nea and may progress to generalized, interstitial infiltrates in the lung, 
requiring intubation and mechanical ventilation [9]. The infection, howev- 
er, is not restricted to the lungs. Other affected organs include the kidneys 
and the liver, as well as the gastrointestinal tract; hence SARS can be con- 
sidered a systemic disease. Epidemiological evidence suggests that the 
transmission of this newly emerging pathogen occurs mainly by face-to-face 
contact, although other routes of transmission cannot be fully excluded. 


Coronavirus main proteinase: function and overall structure 


Coronaviruses are positive-stranded RNA viruses featuring a large 
(27-31 kB), single-stranded RNA genome. The replicase gene, encompass- 
ing more than 20,000 nucleotides, encodes two overlapping polyproteins, 
ppla (replicase 1a, ~ 450 kDa) and pplab (replicase lab, ~750 kDa) [10] 
that mediate all the functions required for viral replication and transcrip- 
tion [11]. Expression of the 3'-proximal portion of the replicase gene 
requires (—1) ribosomal frameshifting [10]. The functional non-structural 
proteins are released from the viral polyproteins by extensive proteolytic 
processing. The vast majority of cleavages are mediated by the viral main 
proteinase, MP'° [12], a cysteine proteinase which is frequently also called 
3C-like proteinase (3CL?"°) to indicate a similarity of its substrate speci- 
ficity to that observed for picornavirus 3C proteinases (3CP'°) (see Tab. 1), 
although the structural similarities between the two families of proteinases 
are limited [13]. The MP'° (3CL?"°) cleaves the polyprotein at no less than 
11 conserved sites involving Leu-Gln| (Ser,Ala,Gly) sequences, a process 
initiated by the enzyme’s own autolytic cleavage from ppla and pplab [14, 
15]. Importantly, this cleavage pattern appears to be conserved in the MP'° 
from SARS coronavirus (SARS-CoV), as deduced from the genomic 
sequence [16, 17] and shown experimentally for one cleavage site [18]. The 
SARS-CoV polyproteins have three non-canonical MP"° cleavage sites with 
Phe, Met or Val in the P2 position, but the same cleavage sites are unusual 
in other coronaviruses as well. The functional importance of MP¥° in the 
viral life cycle makes this proteinase an attractive target for the develop- 
ment of drugs directed against SARS and other coronavirus infections. 
Our laboratories have determined crystal structures for HCoV strain 
229E MP at 2.54 A resolution [18], porcine TGEV MP" at 1.96 A [13], and 
SARS-CoV MP? (1.9 A resolution) [19]. In addition, we have also eluci- 
dated the structures of substrate-analogous hexapeptidyl chloromethyl 
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Table 1: Autocleavage sites of coronavirus MP'° and picornavirus 3CP"° 


P6 PS P4 P3 P2 Pl Pr? P2’ P3’ PY 


HCoV MP!° 

N-terminal Y¥ G S aT 1 Q A G iL; R 
C-terminal F G Vv N L Q S G K T 
TGEV MPr° 

N-terminal Vv N S T L Q S G L R 
C-terminal Y G Vv N L Q A G K Vv 
MHV MP? 

N-terminal T T S F L Q S G I Vv 
C-terminal A G Vv K L Q S K R Vv 
SARS-CoV MP?° 

N-terminal aly S A Vv L Q S G F R 
C-terminal S G Vv TT: F Q G K F I 
HRV-2 3CP?° 

N-terminal L P Vv Vv Vv Q G P N T 
C-terminal ¥ F T D Vv Q G Q I T 
HAV 3CP?° 

N-terminal K G L F S Q A K I S 
C-terminal M E Ts R T Q S F S N 
PV 3CPr° 

N-terminal E T A K Vv Q G P G F 
C-terminal G A Vv al E Q G Y: L N 
Consensus 

Coronavirus MP'° — - small - L Q © small - - - 
Picornavirus 3CP"° — - ) - small QO (G) - - - 


Cleavage site specificities of coronavirus MP's and picornavirus 3CP"°s are similar but differ- 
ent in detail. N- and C-terminal autocleavage sites of MP's and 3CP¥°s are shown. HCoV, 
human coronavirus 229E; TGEV, porcine transmissible gastroenteritis virus; MHV, mouse 
hepatitis virus; SARS-CoV, SARS coronavirus; HRV 2, human rhinovirus serotype 2; HAV, 
hepatitis A virus; PV, poliovirus type 1 (strain Mahoney); », hydrophobic 


ketone inhibitor complexes of the TGEV and SARS-CoV enzymes, at 
around 2.4 A resolution [18, 19]. The structures of all three enzymes show 
that the coronavirus main proteinase consists of three domains, the first two 
of which together distantly resemble chymotrypsin while the third has a 
unique a-helical fold (Fig. 1). Located between domains I and I, the active 
site comprises a Cys-His catalytic dyad, rather than the usual triad. 
Domains | and II (residues 8-99 and 100-183, respectively) are six-stranded 
antiparallel B-barrels. A long loop (residues 184 to 199) connects domain II 
to the C-terminal domain (domain III, residues 200-300), which is a globu- 
lar cluster of five, mostly antiparallel helices. 

The interior of the B-barrel of domain I consists entirely of hydrophobic 
residues. A short a-helix (helix A; residues 53-58) closes the barrel like a 
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Figure 1. Domain structure of the coronavirus main proteinase monomer (shown here is 
TGEV MP", 


lid. Domain II is smaller than domain I and also smaller than the homolo- 
gous domain II of chymotrypsin and the 3C proteinase (3CP'°) of hepatitis 
A virus (HAV) [20-22]. Several secondary structure elements of this 
domain in HAV 3C?'° (strands b2II and cI and the intervening loop) are 
missing in the coronavirus MP'°. The domain II barrel of the MP'° is far from 
perfect (Fig. 1). The segment from Gly135 to Ser146 forms a part of the bar- 
rel, even though it consists mostly of consecutive loops and turns. In fact, in 
contrast to domain I, a structural alignment of domain II onto the picor- 
naviral 3C proteinases has proven difficult. The superposition of domains I 
and II of the TGEV MP’? onto those of the HAV 3C?'° yields a root mean 
square deviation (r.m.s.d.) of 1.85 A for 114 equivalent (out of 184 com- 
pared) C,, pairs, while domain II alone displays an r.m.s.d. of 3.25 A for 57 
(out of 85) C, pairs. 


Coronavirus main proteinase: target for antiviral drug therapy 177 


The three MP'° structures determined so far are quite similar to one 
another. The overall r.m.s. deviations for Ca atoms between the structures 
is below 2 A for all 300 Ca positions of the molecules that could be locat- 
ed in the electron density maps but the isolated domains exhibit signifi- 
cantly smaller deviations. The most variable domain appears to be the hel- 
ical domain III, which shows a better spatial overlap between TGEV MPr° 
and HCoV MP’° than between each of these and SARS-CoV MP"°. To a cer- 
tain extent, these numbers reflect the degree of sequence identity between 
the different MP's. HCoV 229E and TGEV, both being group I coron- 
aviruses, share 61% sequence identity between their main proteinases, 
whereas SARS-CoV as a distant outlier of group II has 40% and 44%, 
respectively, of the sequence of its MPT? in common with HCoV 229E and 
TGEV [18]. Interestingly, domains I and II show a higher degree of 
sequence conservation (42-48% identity) than domain III (36-40%) 
between SARS-CoV MP’? and the coronavirus group I enzymes. 


Dimer formation 


The crystal structure of the TGEV MP"? provided the first indication that 
coronavirus main proteinases form tight dimers [13]. This has since been 
confirmed by the X-ray structures of the HCoV 229E and SARS-CoV 
MP"°s (Fig. 2) [18, 19]. Dimerisation also occurs in solution: we could show 
by dynamic light scattering that both HCoV 229E and TGEV MP" exist as 
a mixture of monomers (~65%) and dimers (~35%) in diluted solutions (1- 
2 mg proteinase/ml). Lai and coworkers have demonstrated that the enzy- 
matic activity of the SARS-CoV MP’? increases linearly with enzyme con- 
centration, suggesting that the enzymatically active species is the dimer and 
not the monomer [23]. Crystal structures of the SARS-CoV enzyme at dif- 
ferent pH values provided an explanation for this observation [19] (see 
below). 

It should be noted that dimer formation as a prerequisite for enzymatic 
activity makes coronavirus main proteinases unique amongst the pro- 
teinases of RNA viruses, and present a clear distinction from the 3C pro- 
teinases of picornaviruses. The TGEV MP"? X-ray structure showed, and the 
other coronavirus MP" structures confirmed, that the monomer-monomer 
contact interface comprises two regions, i.e. i) the N-terminal eight residues 
of molecule A and domain II of molecule B, and ii) the domains III of each 
protomer. Although the latter part of the interface is significantly smaller 
than the former, it is apparently sufficient to hold the isolated domains III 
together in solution, as has recently been shown for SARS-CoV MP"? [24]. 
The crystal structures show that the two molecules are oriented perpendi- 
cular to one another (Fig. 2) [13, 18, 19]. Since the architecture of the dimers 
including most details of intermolecular interaction are the same in TGEV 
MP’e — three independent dimers per asymmetric unit — and HCoV 229E as 
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Figure 2. The SARS-CoV main protease dimer. 


well as SARS-CoV MP'°s — both with one dimer per asymmetric unit -, 1.e., 
in completely different crystalline environments, there can be little doubt 
that dimer formation is of biological relevance in these enzymes. 

In the MPr° dimer, the N-terminal amino acid residues are squeezed in 
between domains II and III of the parent monomer - designated “A” in this 
paragraph — and domain II of the other monomer (“B”), where they make 
a number of very specific interactions that appear tailor-made to bind this 
segment with high affinity. These interactions include intermolecular salt 
bridges between the positively charged amino terminus (residue Al) and 
GluB165 (all residue numbers given in this section are for TGEV MP"°) of 
domain II of the neighboring molecule in the dimer, between ArgA4 and 
GluB286, as well as an intramolecular salt bridge between LysA5 and 
GluA291 of domain III of the parent molecule. Also, there is an important 
hydrophobic contact between the semiconserved LeuA3 and a hydropho- 
bic patch on domain III which includes PheA206, AlaA209, PheA287, 
ValA292, the C? atom of GInA295, and MetA296. All sequenced members 
of the coronavirus proteinase family have a hydrophobic residue in position 
3 (SARS-CoV has Phe), while glycine is absolutely conserved in position 2. 
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The latter residue adopts the a; conformation, which is easily accessible 
only to glycine. This conformation ensures that the N-terminal segment fits 
into its narrow binding canyon in the dimerization interface. 

The existence of a specific binding site for the N-terminal segment of the 
MPre could be important in the activation of the main proteinase by cat- 
alyzing its release from the viral polyprotein. The free N-terminus is creat- 
ed as a consequence of the first cleavage reaction performed by the pro- 
teinase, at its N-terminal autocleavage site. In the mature MP'° monomer, 
the first eight residues from the N-terminus are not a part of the globular 
protein structure. After the autocleavage reaction, they could easily remain 
bound to the substrate-binding site of the proteinase if an alternative high- 
affinity binding site was not provided by dimer formation. The existence of 
such high-affinity binding site ensures that the catalytic site is immediately 
liberated from the product and can act on other cleavage sites in the 
polyprotein. However, the exact placement of the amino terminus also has 
a very important structural role for the mature MP", and is in fact critical 
for its catalytic activity, as we will see below. 


Catalytic site 


In the active site of the MP", Cys144(145) and His41 (SARS-CoV number- 
ing in brackets where different) form a catalytic dyad. In contrast to serine 
proteinases and other cysteine proteinases, which have a catalytic triad, 
there is no third catalytic residue present. HCoV 229E and TGEV MPr°s 
have Val84 in the corresponding sequence position (Cys in SARS-CoV 
MP'°), with its side chain pointing away from the active site. A buried water 
molecule is found in the place that would normally be occupied by the third 
member of the triad; this water is hydrogen-bonded to His41 N61, GIn163 
Ne2, and Asp186 O61 in HCoV 229E MP"? (His, His, and Asp in both SARS- 
CoV and TGEV MP'°). Asp186(187) does not act on the active-site His41 
through the mentioned water molecule, but rather interacts with conserved 
Arg40 by forming a strong inter-domain salt bridge. 

There has been a long-standing debate on the exact nature of the cat- 
alytic centre in cysteine proteinases. Polgar has shown that in papain-like 
cysteine proteinases, the active species is a thiolate-imidazolium ion pair, 
with the imidazolium cation held in the correct position by an asparagine 
residue, the third residue of the catalytic triad [25]. However, such a mech- 
anism does not seem to be predominant in the coronavirus main proteinas- 
es; Lai and coworkers have recently shown that if the active-site cysteine 
residue of SARS-CoV MP"? is replaced by serine, some residual enzymatic 
activity — decreased by a factor of 40 —can still be detected. This can be seen 
as an indication for the existence of a classical general base mechanism in 
the wild-type proteinase, involving the non-ionized Cys145 hydrogen bond- 
ed to the uncharged imidazole of His41 in the ground state of the free 
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enzyme [26]. We note that in viral cysteine proteinases, both picornaviral 
2A and 3C proteinases as well as coronaviral main proteinases, the Sy of the 
active-site cysteine residue is exactly in the plane of the histidine imidazole 
ring, at a distance of between 3.5 and 4.0 A, while in the papain-like cysteine 
proteinases, this is not the case. For hydrogen bonding to occur between the 
two side chains, but not for ion-pair formation, the sulfur has to be more or 
less in the plane of the imidazole. While this preliminary observation has to 
be investigated further, it seems that X-ray crystallography can contribute 
to this important mechanistic question by providing exact geometric details 
of the ground-states of the free enzymes. 


Substrate binding 


Comparison of the structures shows that the substrate-binding sites are well 
conserved among coronavirus main proteinases. This is supported by our 
experimental finding that recombinant SARS-CoV MP'° cleaves a pen- 
tadecapeptide corresponding to the N-terminal autocleavage site of the 
TGEV MP? [18]. 

To allow structure-based design of drugs directed at coronavirus MP'°s, 
we sought to determine the exact binding mode of MP’ substrates. To this 
end, we synthesized the substrate analog hexapeptidyl chloromethyl ketone 
inhibitor, Cbz-Val-Asn-Ser-Thr-Leu-Gln-CMK (subsequently named 
CMK), and soaked it into crystals of TGEV and SARS-CoV MP"? [18, 19]. 
The sequence of the inhibitor was derived from the P6-P1 residues of the 
N-terminal autoprocessing site of TGEV MPre (SARS-CoV MP!° and 
HCoV 229E MP’? have Thr-Ser-Ala-Val-Leu-Gln and Tyr-Gly-Ser-Thr-Leu- 
Gln, respectively, at the corresponding positions). At the outset, we will 
present here the complex between the TGEV MP"? and the inhibitor [18]. 
In the case of the SARS-CoV enzyme, some unusual binding modes were 
observed [19], which we will discuss later. 

X-ray crystallographic analysis of the TGEV MP"? complex at 2.37 A 
resolution revealed difference density for all residues (except the benzy- 
loxycarbonyl (Cbz) protective group) of the inhibitor, in two (B and F) out 
of the six TGEV MP'° monomers in the asymmetric unit [18]. In these 
monomers, there is a covalent bond between the Sy atom of the active-site 
nucleophile Cys144 and the methylene group of the chloromethyl! ketone. 
There are no significant differences between the structures of the enzyme 
in the free and in the complexed state. The substrate-analogue inhibitor 
binds in the shallow substrate-binding site at the surface of the proteinase, 
between domains I and II (Fig. 3A). The residues Val-Asn-Ser-Thr-Leu-Gln 
occupy, and thereby define, the subsites S6 to S1 of the proteinase. Residues 
PS to P3 form an antiparallel B-sheet with segment 164-167 of the long 
strand ell on one side, and they also interact with segment 189-191 of the 
loop linking domains IJ and III on the other. In the discussion that follows, 
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residue numbers for TGEV MP" are followed by those for the SARS-CoV 
enzyme in brackets. 


SI specificity site 


In coronavirus MP"? polyprotein cleavage sites, the P1 position is invariably 
occupied by Gln. At the very bottom of the MP'° S1 subsite, the imidazole 
of His162(163) is suitably positioned to donate a hydrogen bond from Ne2 
to the side-chain carbonyl of the P1 glutamine (Fig. 3A). In order to specif- 
ically bind glutamine and not glutamate in the S1 pocket, this histidine 
residue is required to be uncharged over a broad pH range. This appears to 
be maintained by two important interactions made by the imidazole ring: i) 
stacking onto the phenyl ring of Phe139(140), and ii) accepting a hydrogen 
bond (to N61) from the hydroxyl group of the buried Tyr160(161). In agree- 
ment with this structural interpretation, any replacement of His162 com- 
pletely abolishes the proteolytic activity of HCoV 229E and feline infec- 
tious peritonitis virus (FIPV) MP’? [27, 28]. Furthermore, FIPV MP'° Tyr160 
mutants have their proteolytic activity reduced by a factor of >30 [27]. All 
of these residues are conserved in SARS-CoV MP"? and, in fact, in all coro- 
navirus main proteinases. Other elements involved in the S1 pocket of the 
MP" are the main-chain atoms of Ile51, Leul64(Met165), Glu165(166), and 
His171(172). The latter two residues form a salt bridge at the outer wall of 
the S1 specificity pocket in the TGEV and HCoV 229E MP"°s, as well as in 
one conformation of the SARS-CoV MP’°. However, the latter displays 
another conformation which will be discussed below. 

In all structures of free coronavirus main proteinases that we deter- 
mined so far, we find electron density at the bottom of the S1 pocket. We 
interpreted this as a water molecule or, in the case of TGEV MP" where the 
density was more dumb-bell-shaped, as two water molecules. However, 
especially in the latter case, we realized that apart from a possible hydrogen 
bond between one of these waters and the His162 imidazole, no interac- 
tions with the proteins were made by these water molecules. Also, the posi- 
tions of the two water molecules were in fact too close to one another 
(about 2.1-2.3 A) [13]. When we subsequently determined the structure of 
the complex between the TGEV MP" and the CMK inhibitor, we noted 
that the oxygen and the nitrogen of the side-chain amide occupied a posi- 
tion exactly matching that of the two water molecules in the free enzyme. 
It is therefore possible that the free enzyme in fact also contains some 
residual peptide cleavage product (perhaps from an unidentified E. coli 
protein). Having realized this possibility, we checked for residual difference 
electron density in other subsites but could find nothing, with the exception 
of a patch of relatively strong electron density at the P2 position which we 
had interpreted and successfully refined as a 2-methyl-2,4-pentandiol 
(MPD) molecule from the crystallization medium [13]. The MPD does not 
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fully protrude into the S2 pocket but rather occupies a position correspon- 
ding to the main chain of the peptide substrate. At this point, we still think 
the assignment of this density was correct, but studies on other proteinases 
— e.g. thermolysin — that revealed the presence of cleavage product after 
years of investigation taught us to be cautious. 


S2 specificity site 


Apart from a few exceptions, coronavirus MP?° cleavage sites have a Leu 
residue in the P2 position [28]. The hydrophobic S2 subsite of the TGEV 
enzyme is formed by the side chains of Leu164, [le51, Thr47, His41, and 
Tyr53. In addition, residues 186-188 line the S2 subsite with some of their 
main-chain atoms. The P2 Leu side chain of the inhibitor is well accommo- 
dated in this pocket (Fig. 3A). 

In the SARS-CoV MP? [19], the S2 pocket is similar to what has been 
seen in HCoV MPr° [18] and TGEV MP"? [13]. One wall of the subsite is 
formed by methionine residues 49 and 165, which adopt positions spatially 
similar to TGEV and HCoV 229E MP?° residues Ile51 and Leu/Ile164, 
respectively. The longer methionine side chains restrict the S2 pocket in one 
direction and probably require a slight reorientation of the P2 leucine 
residue of the substrate, compared to TGEV MP"°. On the other side, the S2 
pocket appears to be somewhat enlarged in SARS MP", due to the inser- 
tion of Ala46 which is not present in TGEV or HCoV 229E MP'°, as well as 
to many differences in amino acid sequence. In SARS-CoV MP'°, the 
stretch 45-50 forms a short 3),-helix, whereas this region has no regular sec- 
ondary structure in the other coronavirus main proteinases. The net effect 
of these modifications appears to be a widening of the S2 pocket on this 
side. This may be required for the efficient binding of a P2 phenylalanine, 
which is present in the C-terminal autocleavage site of the MPT°. 


Specificity sites beyond SI and S2 


There is no specificity for any particular side chain at the P3 position of 
coronavirus MP'° cleavage sites. This agrees with the P3 side chain of our 


Figure 3. X-ray crystallographic analysis of M?’® complexes with CMK inhibitor (stereo fig- 
ures) (A) The substrate analogue inhibitor binds in the shallow substrate-binding site at the 
surface of the TGEV proteinase, between domains I and II. (B) In the active monomer A of 
the SARS-CoV MP?° crystallized at pH 6.0, the P1 glutamine of the inhibitor protrudes into 
the S1 pocket, but P2-P5 adopt an unusual binding mode. (C) In the inactive monomer B, the 
S1 pocket is not accessible, and the P1 glutamine points towards bulk solvent. 

A: After [18], with kind permission of the American Association for the Advancement of 
Science (AAAS); B, C: After [19], with kind permission of The National Academy of Sciences 
of the USA. 
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substrate analog being oriented towards bulk solvent in the TGEV MPre 
complex (Fig. 3A). At the P4 position, there has to be a small amino acid 
residue such as Ser, Thr, Val, or Pro because of the congested cavity formed 
by the side chains of Leu164, Leu166, and Gln191 as well as the main-chain 
atoms of Ser189. These residues are conserved or conservatively substitut- 
ed (Leul164=> Met165, Ser189=Thr190) in SARS-CoV MP'°. In the TGEV 
MPr° complex, the P5 Asn side chain interacts with the main chain at 
Gly167, Ser189, and Gln191 (Pro168, Thr190, Gln192 in the SARS-CoV 
enzyme), thus involving the loop linking domains IJ and III, whereas the P6 
Val residue is not in contact with the protein (Fig. 3A). 

Although our CMK inhibitor does not include a P1' residue, it is easily 
seen that the common small P1' residues (Ser, Ala, or Gly) can be easily 
accommodated in the S1' subsite of TGEV MP?° formed by Leu27, His41, 
and Thr47, with the latter two residues also being involved in the S2 subsite 
(Leu, His, and Asp in SARS-CoV MP'°). 

Superimposition of the structures of the T@EV MP'-CMK complex and 
the free HCoV 229E MP"? shows that the two substrate-binding sites are 
basically the same. All residues along the S4-S1 side of the cleft are identi- 
cal, with the exception of the conservative M190L replacement (Ala191 in 
SARS-CoV MP?°). In the SARS pathogen, MP'° residues 168 and 188 - 190 
show some substitutions but since these residues contribute to substrate 
binding with their main-chain atoms only, the identity of the side chains is 
less important. Indeed, the substrate-binding site of the SARS-CoV Mpre 
matches those of its TGEV and HCoV 229E counterparts quite well (Fig. 
3B). And yet, the CMK inhibitor bound to the SARS-CoV MP'° in a 
remarkably different mode (Figs. 3B,C). But before we discuss substrate or 
inhibitor binding to the SARS-CoV main proteinase, we have to analyze 
the dynamical nature of the structure of the free enzyme. 


pH-dependent conformational changes of the SARS-CoV MPr° 


The SARS-CoV MP’? also forms a dimer in the crystal (and in solutions at 
concentrations of >1 mg/ml) with the two protomers oriented almost at 
right angles to each other [19], in an arrangement that is similar to the 
HCoV and TGEV MP"° structures [13, 18]. However, when we first deter- 
mined the structure of the SARS-CoV enzyme by X-ray analysis of crystals 
grown at pH 6.0 [19], we were surprised to see that the substrate-binding 
site of one monomer (denoted “A”, Fig. 4A) was in the normal, active con- 
formation described above for the TGEV and HCoV 229E MP"°s, whereas 
the other (“B”) had adopted a catalytically incompetent form (Fig. 4B). 
There were two major conformational changes that led to this inactivation: 
i) the S1 binding pocket for the glutamine residue at the cleavage site of the 
substrate had collapsed, and ii) so had the the oxyanion hole (Fig. 4B). We 
believe that these changes originate from protonation of HisB163, the con- 
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Figure 4. Conformational changes of SARS-CoV MP?°. (A) Substrate-binding site of one 
monomer in the normal, active conformation; (B) catalytically incompetent form; (C) Schem- 
atic drawing showing the S1 pocket in the active conformation (top) and the inactive confor- 
mation (bottom). After [19], with kind permission of The National Academy of Sciences of the 
USA. 


served residue found at the bottom of the S1 pocket, in monomer B. The pH 
of crystallization of 6.0 is close to the pK of histidine, explaining why the 
residue appears to be protonated in monomer “B” but not monomer “A” of 
the SARS-CoV MPr° dimer. As a consequence of the protonation, the side 
chain of PheB140, which stacks onto HisB163 as long as the latter is unpro- 
tonated, moves away and reorients towards the A3 of the other monomer. 
In order to neutralize the positive charge of HisB163 in its relatively 
hydrophobic environment, GluB166 moves into the S1 pocket and forms a 
very strong salt bridge of 2.43 A with HisB163. This leads to breakage of the 
ion pair between GluB166 and HisB172 as well as of the amino terminus, 
residue Al, of the other monomer in the dimer. This very amino terminus 
also interacts with the main chain of residue B140, the phenylalanine that 
moves away in response to the protonation of HisB163. This interaction, 
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Figure 5. Activity of the SARS-CoV MP"° as a function of pH. 


consisting of two hydrogen bonds, between the hydrogen bond with the 
amino group of Al and the CO of PheB140 as well as between the CO of 
Al and the NH of PheB140, is also lost as part of the conformational 
rearrangements. As a result, the segment around PheB140 (residues B138 - 
B143) is no longer fixed. It adopts a different conformation with positional 
differences from the A chain of up to 5.9 A and becomes poorly ordered, 
although we do see continuous electron density for the main chain and for 
all side chains except PheB140 (Fig. 4B). In fact, the entire loop translates 
towards the active site. Importantly, residue 140 is part of the oxyanion 
loop, which stabilizes the transition state of the proteolytic reaction by 
donating hydrogen bonds from the main-chain amides of Gly143 and 
Cys145 to the negatively charged oxygen of the tetrahedral intermediate. It 
is essential that this loop be in the right conformation for the enzyme to 
exhibit full catalytic activity. However, in the inactive protomer, Gly143 
moves by >3.2 A towards the active site and leaves no space for the oxyan- 
ion to bind. 

As mentioned above, the S1 binding site is partly occupied by GluB166 
interacting with HisB163. In addition, CysB145 adopts two conformations, 
each about half occupied, one of which is filling the remainder of the col- 
lapsed S1 pocket. The other one is interacting with the catalytic HisB41 as 
in the active protomer. In summary, protomer “B” in the dimer is inactive 
because it cannot stabilize the oxyanion transition state and because it is 
unable to bind the P1 glutamine residue of the substrate. This agrees with 
the measured activity of the enzyme as dependent on pH, which displays a 
bell-shaped curve, with a maximum at pH 7.0 and less than 50% activity 
both at pH 6.0 and at pH 8.5 (Fig. 5) [29]. 

In a follow-up study, we equilibrated at pH 7.6 and 8.0 crystals of the 
SARS-CoV MP"° that had been grown at pH 6.0, and determined the struc- 
tures by X-ray analysis [19]. At pH 7.6 (Fig. 6A), the S1 binding pocket is 
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Figure 6. The S1 pocket of SARS-CoV MP"° equilibrated at pH 7.6 (A) and pH 8.0 (B). Left 
panels: molecule A, right panels: molecule B. 


fully formed in both monomers and Phe140 is interacting with the now 
unprotonated His163. The oxyanion is correctly shaped. Glu166 forms ion 
pairs with His172 and the N-terminus of the other monomer in the dimer. 
At pH 8.0 (Fig. 6B), another interesting change takes place. The salt bridge 
between Glu166 and His172 breaks — but not the one between Glu166 and 
the amino terminus of the other protomer in the dimer -, resulting in 
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Figure 7. Schematic drawing of the SARS-CoV MP'° dimer as seen in the crystal structure at 
pH 6.0. Note the important role of the “N-finger”. 


Glu166 becoming more mobile and partly filling the S1 pocket, thereby 
decreasing the activity of the enzyme. This is presumably due to the depro- 
tonation of His172 at the high pH value. The pK value of histidine residues 
involved in salt bridges has been shown to be around 8.0 to 8.5, fully con- 
sistent with our interpretation of the phenomenon. Thus, we propose that 
the low-pH and high-pH flanks of the bell-shaped pH-activity curve (Fig. 5) 
are determined by the protonation and deprotonation of His163 and 
His172, respectively. In this respect, our interpretation differs from that of 
Lai and coworkers who proposed that deactivation of the enzyme at high 
pH is due to deprotonation of the active-site cysteine [26]. In our opinion, 
the latter would only play a role at even higher pH values of >9. 

The different protonation states of His163 and His172 and their influ- 
ence on the conformation of the substrate-binding site were further 
explored by molecular-dynamics calculations over 10 ns each (Tan et al., in 
preparation). The results are in full agreement with the interpretation of the 
crystal structures put forward here. Furthermore, molecular dynamics cal- 
culations have the advantage that they can simulate states of the system 
under investigation that are not easily accessible to experiment. A dynam- 
ics run at pH 5.0, with His163 protonated in both monomers of the dimer, 
revealed a tendency of the two S1 pockets to collapse, with Glu166 switch- 
ing between the protonated His163 and His172 residues over the time of 
the simulation. 

It is important to note that none of the conformational differences seen 
between monomers “A” and “B” of the SARS-CoV MP? when crystallized 
at pH 6.0 is due to crystal packing artefacts. Symmetry-related contacts 
between dimers mainly involve domain HI of both molecules; the only 
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residues in domains I and IJ to display strong crystal contacts are His A134, 
Asn B53, Asp B56, Ile B59, Arg B60, His B134. Furthermore, the fact that 
the conformation of monomer B flips back to the normal, active one with- 
in the same crystal lattice after equilibration of the crystals at higher pH 
values, proves that crystal packing does not play a role here. 

The amino terminus of the SARS-CoV MP’? plays an important role in 
our structural model, by interacting with essential residues of the substrate- 
binding site of the other monomer in the dimer. Like a finger, the N-termi- 
nus docks into its specific binding site (Fig. 7) and flicks on the enzymatic 
activity of the other monomer. Hydrogen bonding from the “N-finger” to 
the main chain of Phe140 helps shape the oxyanion loop, and ion-pair for- 
mation with Glu166 prevents the latter from blocking the S1 specificity 
pocket. If a free N-terminus is not available at position 1 of the MP?°, such 
as in the maltose-binding protein (MBP) fusion protein or in the presence 
of an N-terminal histidine tag, the enzymatic activity should be reduced and 
this has indeed been observed, at least with the MBP fusion protein [13]. In 
order to further demonstrate the importance of the N terminus, we deleted 
residues 1 to 5 in the TGEV MP’? and indeed found the enzyme to be 
almost totally inactive [13]. This observation also solves the mystery of the 
importance of the “extra” helical domain III of the main proteinase. Its task 
is to provide defined interaction points for the N-terminal segment and 
thereby locating the tip of the N-finger to its specific interaction site with 
Phe140 and Glu166 of the other monomer. Accordingly, deletion of domain 
II also results in an inactive enzyme [13]. The enzyme is dimeric because 
of the essential role played by the N-finger, but the N-terminal segment is 
not essential for dimerization, as shown recently by Shi et al. [24] and also 
indicated by steered molecular dynamics calculations of aSARS-CoV MPre 
dimer devoid of the N-terminal residues (Tan et al., personal communica- 
tion). 

A transition between a catalytically competent and an incompetent con- 
formation of the oxyanion-binding site has also been seen in the crystal 
structure of equine arterivirus main proteinase (nsp4), where the loop 
structurally equivalent to the MP'° 139-143 loop undergoes a peptide flip, 
resulting in misorientation of the amide corresponding to Gly143 [30]. Also, 
we have little doubt that the same pH-induced conformational changes that 
we observed in the SARS-CoV MP'° does exist in all other coronavirus 
main proteinases. We simply failed to detect it earlier because we had crys- 
tallized the MPs of TGEV and HCoV 229E at pH >8.0, ie. with both 
monomers adopting an active conformation [13, 18]. 

In conclusion, SARS-CoV MP"° appears to possess a pH-triggered acti- 
vation switch, which may regulate proteolytic activity in different cellular 
compartments. Such a pH switch would provide the virus with a mechanism 
to synchronize the polyprotein processing with the microenvironment. 
Further mutagenesis and biochemical studies on the details of the switch 
are clearly desirable. 
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Self-activation of the main proteinases 


The MP?° is part of the viral polyprotein itself and has to be activated by 
autocleavage from this inactive precursor before it can cleave the polypro- 
tein at other cleavage sites. So far, it has been completely unclear how this 
autoactivation might occur but the crystallographic work on SARS-CoV 
MP’? now provides a clue. The pH-dependent conformational changes seen 
in the X-ray structures [19] could well correlate with the structural arrange- 
ments occurring during self-activation of the enzyme. 

Moreover, a comparison with the transition between chymotrypsin and 
its proenzyme, chymotrypsinogen, reveals a remarkable similarity in the 
mechanisms used. In mature chymotrypsin, the charged amino terminus, 
created after cleavage of the proenzyme at the Arg15-e16 bond, forms an 
ion pair with Asp194, a residue involved in forming the wall of the S1 speci- 
ficity pocket. In chymotrypsinogen, on the other hand, the N-terminal 
extension of the polypeptide chain by 15 residues causes disruption of this 
ion pair [31]. The Asp194 side chain changes conformation and so does, 
along with it, Gly193, whose amide forms part of the oxyanion hole. As a 
consequence, interaction of Gly193, which corresponds to Gly143 of SARS- 
CoV MPr°, with the tetrahedral intermediate is no longer possible. In 
SARS-CoV and other coronaviruses, the polyprotein upstream to the N- 
terminal autocleavage site of the MP'° corresponds to the propeptide in 
chymotrypsinogen. The role of Asp194 of the latter is taken by Glu166 in 
SARS-CoV MP'° and its proenzyme, and the oxyanion-binding region 
changes in a similar way. The major difference between the two enzymes is 
that the event is intramolecular in chymotrypsin but intermolecular in 
SARS-CoV MP9, 

It is a matter of debate whether the autocleavage of the MP'°s from the 
polyprotein precursors occur in cis or in trans. In the case of mouse hepati- 
tis (corona)virus (MHV) MP?°, autocleavage has exclusively been observed 
in trans [28]. On the other hand, it has been shown for several picornavirus 
3C proteinases, e.g. hepatitis A virus 3C?'° [32], that the first step of auto- 
cleavage of the enzyme, i.e. processing at the N-terminal cleavage site, occurs 
in cis, followed by other cleavage reactions in trans. Molecular modeling 
studies based on the TGEV MP" crystal structure suggested that N-terminal 
precursor cleavage in cis should be possible, because by introducing a bend 
around residues 11 to 13, the extended N-terminal chain can be quite easily 
folded back into the active site of its own molecule, even though the distance 
between the amino terminus (A1) and CysA145 Sy is about 34 A [13]. On 
the other hand, bringing the N terminus from the position seen in the crys- 
tal structure into the catalytic site of the other molecule in the dimer proves 
quite difficult, even though the distance is only about 11 A. Thus, the fact 
that the MP'°s are dimers does not necessarily support cleavage in trans. The 
SARS-CoV MP"? structure rather suggests a different role for the dimeriza- 
tion, the mutual activation of the subunits by transition from the proenzyme 
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conformation to that of the active enzyme, triggered by binding of N-fingers 
to the activation switch. Clearly, asymmetric dimers consisting of one mole- 
cule in the mature conformation and one in the proenzyme conformation, as 
seen in the SARS-CoV MP" crystal structure at pH 6.0, would be important 
intermediates in this process. Polyprotein processing in SARS-CoV-infected 
cells could thus be initiated by the occasional dimerization of two mem- 
brane-anchored MP'° domains within polyproteins and mutual activation 
through the N-finger tips, or by the rare event of cis-autocleavage of a sin- 
gle MPr° domain which then binds to another one within polyprotein, again 
activating it through the N-finger. The results are highly active MP° dimers 
which can then cleave other substrates in trans. Such a mechanism could 
lead to self-acceleration of MP'° autoprocessing, after a slow onset triggered 
by arare initial event. This mechanism is consistent with the fact that in cells 
infected with HCoV 229E or MHV, long-lived cleavage intermediates are 
usually not observed, but polyprotein is processed with increasing speed, 
once the first cleavage has occurred [28, 33]. 


Complex of the SARS-CoV MP’? with a chloromethyl ketone 
inhibitor 


After dissecting the details of conformational dynamics of the SARS-CoV 
main proteinase, we are now finally in the position to competently discuss 
the interactions of the enzyme with substrate. To visualize these, the sub- 
strate-analoguous hexapeptidyl CMK inhibitor, Cbz-Val-Asn-Ser-Thr-Leu- 
Gln-CMK, was soaked into the crystals of the SARS-CoV enzyme grown at 
pH 6.0, and the structure of the complex was determined at 2.5 A resolu- 
tion [19]. We expected to see one of two possible cases, i.e. either i) the 
inhibitor would only bind to the protomer in the active conformation 
(monomer “A”), or ii) it would bind to the inactive protomer (monomer 
“B”) as well, thereby imposing the active conformation onto the latter by 
an induced-fit mechanism. Quite unexpectedly, neither assumption turned 
out to be true. Instead, whereas the inhibitor does bind to both subunits and 
covalent bonds are being formed between the Sy of Cys145 and the meth- 
ylene group of the CMK, the inhibitor exhibits two different modes of bind- 
ing, each of which is unusual and unexpected (Fig. 3B, C). Protomer “B” 
remains in the inactive conformation, and the inhibitor does not open the 
occluded S1 pocket to allow the P1 Gln to enter the specificity site 
(Fig. 3C). Instead, the P1 Gln points out of the substrate-binding cleft and 
towards bulk solvent. On the other hand, P2 Leu and P4 Ser bind to the 
appropriate specificity pockets that have been observed in the structure of 
TGEV MPre [13], whereas P3 Thr is oriented toward bulk solvent. This 
result can be interpreted as resulting from the high chemical reactivity of 
the CMK moiety with obvious concurrent inability of the inhibitor to over- 
come the protonation of HisB163 at the bottom of the S1 pocket at pH 6.0. 
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Binding of the CMK inhibitor to protomer A, which exhibits the active 
conformation, is even less conventional (Fig. 3B).The P1 glutamine resides 
in the S1 pocket and its side-chain carbonyl accepts a 2.8-A hydrogen 
bond from the Ne2 atom of HisA163, whereas its side-chain amide nitro- 
gen donates a 2.8-A hydrogen bond to the carboxylate of GluA166, ensur- 
ing the specificity for glutamine in the S1 subsite. However, the P2 residue 
fails to penetrate the S2 subsite and becomes partially solvent-accessible. 
This non-canonical binding results in a frameshift in subsite interaction: P3 
Thr occupies the S2 pocket instead and P5 Asn binds at the S4 subsite. We 
note that the P2 residue specificity of SARS-CoV MP?° is less stringent 
than that of other CoV MP"°s, which, with few exceptions, are restricted to 
leucine [28]. The unexpected binding mode of the substrate-like inhibitor 
may well be a consequence of the more relaxed specificity requirements of 
the SARS-CoV MP'°, compared to other coronaviral main proteinases. 
However, before further conclusions are made, it has to be excluded that 
the binding mode observed is due to an artefact, perhaps because the 
sequence of the peptidyl CMK inhibitor had been derived from the TGEV 
MPro and not the SARS-CoV MPr° N-terminal autocleavage site. 
Therefore, we are presently determining the crystal structures of the com- 
plexes of SARS-CoV MP?° with other inhibitors, both peptidic and non- 
peptidic. 


Structure-based design of anti-SARS inhibitors 


When the genome sequence of the SARS coronavirus was published on the 
internet on April 13,2003 [16], we immediately located the segment in orfla 
coding for the main proteinase and aligned the amino acid sequence of the 
enzyme with the MPr°s of other coronaviruses. Next, we constructed a 
homology model of the SARS-CoV MP", based on this alignment and our 
crystal structures of the TGEV and HCoV 229E main proteinases [18]. This 
theoretical model revealed that the substrate-binding site of the SARS- 
CoV enzyme is very similar to those of the two other viral enzymes. At that 
time, less than one month after the discovery of the new SARS virus and at 
the peak of the epidemic in China, this structural model was the best we 
could get, and it proved in fact very useful (see below). Of course, a theo- 
retical homology model will always be inferior to an experimental crystal 
structure, but in the absence of the latter, it can provide a reasonably good 
basis for structure-assisted drug discovery [34, 35]. When we had deter- 
mined the crystal structure of the SARS-CoV MP’? another 10 weeks later 
[19], the overall r.m.s. deviation for all Ca atoms of the protein between 
theoretical model and experimental structure was found to be as high as 
3.8 A. However, this unreasonably high value is only caused by large devia- 
tions of the last four residues at the flexible C-terminus, which deviate by 
more than 12 A, and by slight differences of the mutual orientation of the 
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three domains. If the models for the individual domains are compared with 
the crystal structure, and only this is a fair comparison in view of the large 
flexibility of the enzyme, the r.m.s. deviation is down to little more than 1 A. 

Whatever the view taken on this issue, the homology model served pri- 
marily one purpose, and that was to facilitate the design of anti-SARS 
inhibitors directed at the main proteinase. With the template of the peptidyl 
CMkK inhibitor — as bound to the TGEV MP’? - at hand, we compared its 
binding mode to those seen for all other inhibitors of cysteine proteinases 
that occupy the N-terminal half of the substrate binding site (subsites Sn- 
S1), as long as atomic coordinates were available in the Protein Data Bank 
(PDB). We found that AG7088, a compound carrying a vinylogous ethyl 
ester instead of a CMK group and designed to block human rhinovirus 
(HRV) 3C proteinase [36], interacted in much the same way with its target 
enzyme than did our CMK inhibitor (Fig. 8) [18]. 


P2 = p-fluoro-benzyl: AG7088 (CAS number: 223537-30-2, Pfizer Inc.) 


As mentioned before, the picornavirus 3C proteinases (3CP'°) have a chy- 
motrypsin-related structure, reminiscent of domains I and II of the coron- 
avirus MP?°, although several of the secondary-structure elements are 
arranged differently, making structural alignment difficult with sequence 
identities < 10%. Also, they completely lack a counterpart to domain III of 
coronavirus MP's, Nevertheless, the substrate specificity of picornavirus 
3CP's [32, 37] for the P1', Pl and P4 sites is similar to that of the coron- 
avirus MP'°s (hence the original designation “3C-like” for the latter; see 
Tab. 1). Only parts of the two structures, TGEV MP’? in complex with the 
CMK inhibitor and HRV-2 in complex with AG7088, can be spatially 
superimposed (Fig. 8; r.m.s. deviation of 2.10 A for 134 pairs of Ca posi- 
tions out of the ~ 180 residues in domains IJ and II). Both inhibitors, the 
hexapeptidyl CMK and AG70838, bind to their respective target proteinas- 
es through formation of an antiparallel B-sheet with strand ell. However, 
completely different segments of the polypeptide chain interact with the 
substrate analogues on the opposite site: residues 188 to 191 of the loop 
connecting domains II and III in MP’, as opposed to the short B-strand 126 
to 128 in HRV 3C?P"°. As a result, the architectures of the S2 subsites are 
entirely different between the two enzymes; hence, the different specifici- 
ties for the P2 residues of the substrates (Leu versus Phe). The inhibitor 
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AG7088 has a p-fluorophenylalanine side chain (p-fluorobenzyl) in this 
position. Based on molecular modeling, we proposed that this side chain 
might be too long to fit into the S2 pocket of coronavirus MP'°, but an 
unmodified benzyl group would probably fit, as evidenced by Phe occur- 
ing in the P2 position of the C-terminal autocleavage site of the SARS 
coronavirus enzyme (see Tab. 1). Apart from this difference, the superim- 
position of the two complexes (Fig. 8) suggested that the side chains of 
AG7088 binding to subsites S1 (lactone derivative of glutamine) and S4 
(5-methyl-isoxazole-3-carbonyl) can be easily accommodated by the coro- 
navirus MP'°. Thus, we proposed that AG7088 could serve as a starting 
point for modifications which should quickly lead to an efficient and 
bioavailable inhibitor for coronavirus main proteinases. In fact, such mod- 
ifications have been applied (work mostly unpublished so far; a published 
proposal for modification is found in [38]) and we will see a number of 
good inhibitors designed on this basis appear in the literature in the next 
few months. 

Another possible approach to inhibiting the SARS-CoV main pro- 
teinase could make use of the insight into the importance of the N-finger in 
activating the enzyme. One could screen for competitve inhibitors that pre- 
vent MP’° dimerization through binding to the N-finger docking site, but 
without activating the Glul66/Phe140 switch. Such an approach would 
open a second front in addition to active site-directed inhibitors, in the 
urgent search for substances efficiently blocking the SARS coronavirus. 

Several groups have proposed to use HIV-1 proteinase inhibitors as 
starting points for designing inhibitors directed at the SARS-CoV MP", 
However, many of these studies remained at the modelling level and failed 
to prove the usefulness of the HIV-1 proteinase inhibitors experimentally 
[39, 40]. Some of the reports did not even seem to acknowledge the fact 
that the enzymatic mechanisms of HIV-1 proteinase, an aspartic pro- 
teinase, and SARS-CoV main proteinase, a cysteine proteinase, are very 
different. One of the few studies reporting a clinical usefulness of the appli- 
cation of HIV-1 proteinase inhibitors was published by Peiris et al. [41]. 
According to their results, a combination of the HIV-1 proteinase inhibitor 
kaletra and ribavirin was more effective than ribavirin alone, which, how- 
ever, was basically inactive, as shown later [42]. Based on the present struc- 
tural knowledge, it is not obvious how HIV-1 proteinase inhibitors should 
block SARS-CoV. 

In conclusion, prospects for discovering and developing inhibitors of the 
SARS-CoV main proteinase are good, given the structural knowledge that 
is available for the target today. It should also be emphasized that the crys- 
tal structures of the various MP'°s allow very efficient virtual screening [43] 
for existing organic compounds that might bind and inhibit the target. This 
approach is currently yielding promising inhibitors, which also exhibit anti- 
SARS activity on virus-infected Vero cells (Jiang, personal communica- 
tion). 
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Conclusions 


The three-dimensional structures presented here for coronavirus main pro- 
teinases provide a solid basis for the design of anti-coronaviral drugs. The 
binding modes of peptidic inhibitors and substrates have been revealed by 
the crystal structure of TGEV MP'° in complex with a hexapeptidyl 
chloromethyl ketone, and, within limitations, of the complex with the 
SARS-CoV MP’? and the same inhibitor. The structure of the SARS-CoV 
enzyme was determined at three different pH values and revealed a pH- 
dependent activation switch for the MP'°. The pH-activity profile of the 
main proteinase was shown to be governed by the protonation and depro- 
tonation of two histidine residues in the substrate-binding site. The confor- 
mational changes observed also suggested a mechanism for the self-activa- 
tion of the MP'° by autocleavage from the viral polyprotein. On the basis of 
structural comparisons between the TGEV MP"? in complex with a hexa- 
peptidyl CMK inhibitor and the HRV 3C proteinase in complex with the 
vinylogous ethyl ester AG7088, it has been proposed that whereas the lat- 
ter would not be a good inhibitor of the SARS-CoV MP'°, it should be a 
good starting point for designing such inhibitors. Such design studies are 
being performed at the moment and will start to yield potent anti-SARS 
compounds in the next few months and years. 
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